Mixed Precision, FP16, WMMA, Matrix Multiplication, Deep Learning Acceleration

zFLoRA: Zero-Latency Fused Low-Rank Adapters
arxiv.org·1d
ONNX Runtime
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·1d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Squeezing AI into Tiny Spaces: The Integer Revolution
dev.to·2d·
Discuss: DEV
📉Model Quantization
Flag this post
TinyML is the most impressive piece of software you can run on any ESP32
xda-developers.com·1d
ONNX Runtime
Flag this post
GIR-Bench: Versatile Benchmark for Generating Images with Reasoning
paperium.net·1d·
Discuss: DEV
🏎️TensorRT
Flag this post
An intro to the Tensor Economics blog
lesswrong.com·3d
🏎️TensorRT
Flag this post
The Role of GPUs in Accelerating Deep Learning Training
acecloud.ai·2d·
Discuss: DEV
🔗NCCL
Flag this post
The next RISC-V processor frontier: AI
edn.com·1d
🧠CPU Architecture
Flag this post
Inference Acceleration from the Ground Up
semiwiki.com·3d
🧠CPU Architecture
Flag this post
Review of Intel-based UP AI development kits – Part 1: Unboxing and first boot to Ubuntu Pro 24.04
cnx-software.com·10h
🔍Nsight
Flag this post
Duality-Based Fixed Point Iteration Algorithm for Beamforming Design in ISAC Systems
arxiv.org·1d
🔗Kernel Fusion
Flag this post
Show HN: Fast-posit, sw implementation of posit arithmetic in Rust
github.com·2d·
Discuss: Hacker News
🔍Type Checkers
Flag this post
Sparse Adaptive Attention “MoE”: How I Solved OpenAI’s $650B Problem With a £700 GPU
medium.com·4d·
Flash Attention
Flag this post
Opportunistically Parallel Lambda Calculus
dl.acm.org·1d·
Discuss: Hacker News
💡LSP
Flag this post
AI efficiency advances with spintronic memory chip that combines storage and processing
techxplore.com·3d
Flash Attention
Flag this post
Accelerating AI inferencing with external KV Cache on Managed Lustre
cloud.google.com·1d
ONNX Runtime
Flag this post
VerfCNN, Optimal Complexity zkSNARK for Convolutional Neural Networks
eprint.iacr.org·2d
🧮cuDNN
Flag this post
How fast can an LLM go?
fergusfinn.com·2d·
Discuss: Hacker News
🏎️TensorRT
Flag this post
Custom Intelligence: Building AI that matches your business DNA
aws.amazon.com·1d
🤖AI Coding Tools
Flag this post
[D] Best (free) courses on neural networks
reddit.com·3h·
👁️Attention Optimization
Flag this post